The nine systems of the CLUSTER-Prime dataset.


1.The description of the CLUSTER-Prime dataset 

 (1) *CLASS_relationInfo_whole.ser*: The whole code dependence relation. You can use it to get code regions by setting thresholds of call dependence and data dependence. 

 (2) *normalized-uc*:  The documents of both requirements and classes are normalized by standard pre-processing techniques including splitting identifiers, special token elimination, stemming, and stop word removal.

 (3) *RTM_CLASS*: Requirements-to-code Trace Matrices.


2. Links to download source code of nine systems

 (1) iTrust: https://sourceforge.net/projects/itrust/files/itrust/13.0/

 (2) Maven: https://github.com/apache/maven
 
 (3) Pig: https://github.com/apache/pig

 (4) Infinispan: https://github.com/infinispan/infinispan

 (5) GanttProject: https://github.com/bardsoftware/ganttproject

 (6) Seam2: http://www.seamframework.org/Seam2.html

 (7) Drools: https://github.com/kiegroup/drools

 (8) Derby: https://github.com/apache/derby

 (9) Groovy: https://github.com/apache/groovy



